O 

(N 

00 



Linear vs. Semidefinite Extended Formulations: 
Exponential Separation and Strong Lower Bounds 

Samuel Fiorini*^, Serge Massar^"^, Sebastian Pokutta^, Hans Raj Tiwary-'-^, and Ronald 

de Wolf§4 

"'^Department of Mathematics, Universite libre de Bruxelles CP 216, Boulevard du Triomphe, 1050 Brussels, 

Belgium. Email: {sfiorini, htiwary}@ulb. ac.be 
■^Department of Mathematics, Friedrich-Alexander-Universitat Erlangen-Niirnberg, Am Weichselgarten 9, 91058 
Erlangen, Germany. Email: sebastian.pokutta@math.uni-erlangen.de 
■^Laboratoire d'Information Quantique, Universite hbre de Bruxelles CP 225, Boulevard du Triomphe, 1050 

Brussels, Belgium. Email: smassar@ulb.ac.be 
CWI and University of Amsterdam, Amsterdam, The Netherlands. Email: rdewolf@cwi.nl 



Q . July 19, 2012 



Abstract 

We solve a 20-year old problem posed by Yannakakis and prove that there exists no polynomial-size 
linear program (LP) whose associated polytope projects to the traveling salesman polytope, even if the 
LP is not required to be symmetric. Moreover, we prove that this holds also for the cut polytope and 
the stable set polytope. These results were discovered through a new connection that we make between 
one-way quantum communication protocols and semidefinite programming reformulations of LPs. 
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(3 ■ 1 Introduction 

Since the advent of the simplex method |13| . linear programming has become a prominent tool for solving 
optimization problems in practice. On the theoretical side, LPs can be solved in polynomial time via either 
the ellipsoid method or interior point methods |27j . 

In 1986-1987 there were attempts [l^ to prove P = NP by giving a polynomial-size LP that would solve 
the traveling salesman problem (TSP). Due to the large size and complicated structure of the proposed LP 
for the TSP; it was difficult to show directly that the LP was erroneous. In a groundbreaking effort to refute 
all such attempts, Yannakakis [54] proved that every symmetric LP for the TSP has exponential size (see [55] 
for the journal version). Here, an LP is called symmetric if every permutation of the cities can be extended 
to a permutation of all the variables of the LP that preserves the constraints of the LP. Because the proposed 
LP for the TSP was symmetric, it could not possibly be correct. 

In his paper, Yannakakis left as a main open problem the question of proving that the TSP admits no 
polynomial-size LP, symmetric or not. We solve this question by proving a super-polynomial lower bound on 
the number of inequalities in every LP for the TSP. We also prove such unconditional super-polynomial lower 
bounds for the maximum cut and maximum stable set problems. Therefore, it is impossible to prove P = NP 
by means of a polynomial-size LP that expresses any of these problems. Our approach is inspired by a close 
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connection between semidefinite programming reformulations of LPs and one-way quantum communication 
protocols that we introduce here. 

1.1 State of the Art 

Solving a Problem Through an LP A combinatorial optimization problem such as the TSP comes with 
a natural set of binary variables. When we say that an LP solves the problem, we mean that there exists 
an LP over this set of variables plus extra variables that returns the correct objective function value for all 
instances over the same set of natural variables, that is, for all choices of weights for the natural variables. 

Prom Problems to Polytopes When encoded as 0/1-points in W^, the feasible solutions of a combinato- 
rial optimization problem yield a polytope that is the convex hull of the resulting points (see Appendix [Al for 
background on polytopes). Solving an instance of the problem then amounts to optimizing a linear objective 
function over this polytope. 

For example, the TSP polytope TSP(ri,) is the convex hull of all points x £ {0, that correspond to a 
Hamiltonian cycle in the complete ri- vertex graph A"„. If we want to solve a TSP instance with edge- weights 
uiy , the goal would be to minimize X]i<j ^ij^ij for ^ £ TSP(77,). This minimum is attained at a vertex of 

the polytope, i.e., at an x G {0, l}^^) that corresponds to a Hamiltonian cycle. 

The idea of representing the set of feasible solutions of a problem by a polytope forms the basis of a 
standard and powerful methodology in combinatorial optimization, see, e.g., |44| . 

Extended Formulations and Extensions Even for polynomially solvable problems, the associated poly- 
tope may have an exponential number of facets. By working in an extended space, it is often possible to 
decrease the number of constraints. In some cases, a polynomial increase in dimension can be traded for an 
exponential decrease in the number of constraints. This is the idea underlying extended formulations. 
Formally, an extended formulation (EF) of a polytope P C R'' is a linear systeirQ 

Ex + Fy = g, y^Q (1) 

in variables {x^y) £ R''+'', where E,F are real matrices with d,r columns respectively, and g is a column 
vector, such that a; G P if and only if there exists y such that (H]) holds. The size of an EF is defined as its 
number of inequalities in the system^ Optimizing any objective function f{x) over all a; G P amounts to 
optimizing f{x) over all {x,y) S R'^^'" satisfying ([1]), provided ([1]) defines an EF of P. 

An extension of the polytope P is another polytopclfl Q such that P is the image of Q under a linear 
map. We define the size of an extension Q as the number of facets of Q. If P has an extension of size r, then 
it has an EF of size r. Conversely, it is known that if P has an EF of size r, then it has an extension of size 
at most r (see Theorem [3] below). In this sense, the concepts of EF and extension are essentially equivalent. 

The Impact of Extended Formulations EEs have pervaded the areas of discrete optimization and 
approximation algorithms for a long time. For instance, Balas' disjunctive programming [5], the Sherali- 
Adams hierarchy [3^, the Lovasz-Schrijver closures [37jj lift-and-project [^, and configuration LPs are all 
based on the idea of working in an extended space. Recent surveys on EFs in the context of combinatorial 
optimization and integer programming are |12[ 1501 1251 153| . 

^Wc restrict ourselves to linear systems in slack form, that is, containing only equalities and nonncgativity inequalities. 
Notice that this can be done without loss of the generality, since any extended formulation can be brought in slack form without 
changing the number of inequalities. 

^Another possible definition of size is the sum of the number of variables and total number of constraints (equalities plus 
inequalities) defining the EF. This makes little difference because if P C R'* has an EF with r inequalities, then it has an EF 
with d + r variables, r inequalities and at most d + r equalities (see the proof of Theorem [3] below) . 

^We could allow unbounded polyhedra here, but it again makes little difference because it can be shown that every extension 
of a polytope with a minimum number of facets is also a polytope. 
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Symmetry Matters Yannakakis [5S] proved a 2^*^"^ lower bound on the size of any symmetric EF of the 
TSP polytope TSP(n) (defined above and in Section [3^ . Ahhough he remarked that he did "not think 
that asymmetry helps much", it was recently shown by Kaibel et al. (see also |3D]) that symmetry is 
a restriction in the sense that there exist polytopes that have polynomial-size EFs but no polynomial-size 
symmetric EF. This revived Yannakakis's tantalizing question about unconditional lower bounds. That is, 
bounds which apply to the extension complexity of a polytope P, defined as the minimum size of an EF of 
P. 

0/1-Polytopes with Large Extension Complexity The strongest unconditional lower bounds so far 
were obtained by Rothvofj|42j. By an elegant counting argument inspired by Shannon's theorem j45| . it 
was proved that there exist 0/1-polytopes in R'' whose extension complexity is at least 2''/^~°('^). However, 
Rothvofi's technique does not provide explicit 0/1-polytopes with an exponential extension complexity. 

The Factorization Theorem Yannakakis [SS] discovered that the extension complexity of a polytope P 
is determined by certain factorizations of an associated matrix, called the slack matrix of P, that records for 
each pair (F, v) of a facet F and vertex v, the algebraic distance of u to a hyperplane supporting F. Defining 
the nonnegative rank of a matrix M as the smallest natural number r such that M can be expressed as 
M = TU where T and U are nonnegative matrices (i.e., matrices whose elements are all nonnegative) with 
r columns (in case of T) and r rows (in case of [/), respectively, it turns out that the extension complexity 
of every polytope P is exactly the nonnegative rank of its slack matrix. 

This factorization theorem led Yannakakis to explore connections between EFs and communication com- 
plexity. Let S denote the slack matrix of the polytope P. He proved that: (i) every deterministic com- 
munication protocol of complexity k computing 5* gives rise to an EF of P of size at most 2*^, provided S 
is a 0/1-matrix; (ii) the nondeterministic communication complexity of the support matrix of S (i.e., the 
binary matrix that has 0-entries exactly where S is 0) yields a lower bound on (the base-2 logarithm of) the 
extension complexity of P, or more generally, the nondeterministic communication complexity of the support 
matrix of every nonnegative matrix M yields a lower bound on (the base-2 logarithm of) the nonnegative 
rank of A/Q 

Tighter Communication Complexity Connection Faenza et al. [IS] proved that the base-2 logarithm 
of the nonnegative rank of a matrix equals, up to a small additive constant, the minimum complexity of 
a randomized communication protocol with nonnegative outputs that computes the matrix in expectation. 
In particular, every EF of size r can be regarded as such a protocol of complexity logr -I- 0(1) bits that 
computes a slack matrix in expectation. 

The Clique vs. Stable Set Problem When P is the stable set polytope STAB(C?) of a graph G (see 
Section l3.3p . the slack matrix of P contains an interesting row-induced 0/1-submatrix that is the commu- 
nication matrix of the clique vs. stable set problem (also known as the clique vs. independent set problem): 
its rows correspond to the cliques and its columns to the stable sets (or independent sets) of graph G. The 
entry for a clique K and stable set S equals 1 — [if fl Sj. Yannakakis [55j gave an 0(log^7i) deterministic 
protocol for the clique vs. stable set problem, where n denotes the number of vertices of G. This gives a 
20(iog=«) ^ ^0(iog«) gj^e EF for STAB(G) whenever the whole slack matrix is 0/1, that is, whenever G is a 
perfect graph. 

A notoriously hard open question is to determine the communication complexity (in the deterministic 
or nondeterministic sense) of the clique vs. stable set problem. (For recent results that explain why this 
question is hard, see [211 132].) The best lower bound to this day is due to Huang and Sudakov [53]: they 

''The classical nondeterministic communication complexity of a binary communication matrix is defined as [logB], where 
B is the minimum number of monochromatic 1-rectangles that cover the matrix, see 1301 . This last quantity is also known as 
the rectangle covering bound. It is easy to see that the rectangle covering bound of the support matrix of any matrix M lower 
bounds the nonnegative rank of M (see Theorem |4] below) . 
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obtained a | logn — 0{1) lower bound^ Furthermore, they state a graph-theoretical conjecture that, if true, 
would imply a r2(log^ n) lower bound, and hence settle the communication complexity of the clique vs. stable 
set problem. Moreover it would give a worst-case n^'^'"^;") lower bound on the extension complexity of stable 
set polytopes. However, a solution to the Huang-Sudakov conjecture seems far away. 

Other Related Works Yannakakis's paper has deeply influenced the TCS community. In addition to the 
works cited above, it has inspired a whole series of papers on the quality of restricted approximate EFs, such 
as those defined by the Sherali- Adams hierarchies and Lovasz-Schrijver closures starting with [3] ([4] for the 
journal version), see, e.g., |101 l43l \T7\ fTTl \20\ [T9\ [7]. A framework for studying the power of approximate 
EFs independent of specific hierarchies has been developed in [8]. 

1.2 Contribution 

Our contribution in this paper is three-fold. 

• First, inspired by earlier work [52] , we define a 2" x 2" matrix M = M{n) and show that the nonnegative 
rank of AI is 2^^"^ because the nondeterministic communication complexity of its support matrix is 
n{n). The latter was proved by de Wolf |52j using the well-known disjointness lower bound of Razborov 
[IT] . We use the matrix M to prove a 2^(") lower bound on the extension complexity of the cut polytope 
CUT(n) (Section l3.2p . That is, we prove that every EF of the cut polytope has an exponential number 
of inequalities. Via reductions, we infer from this: (i) an infinite family of graphs G such that the 
extension complexity of the corresponding stable set polytope STAB(G') is 2^^^" \ where n denotes 
the number of vertices of G (Section 13. 3|) : (ii) that the extension complexity of the TSP polytope 
TSP(n) is 2*^^("''') (Section ISU) . 

In addition to settling simultaneously the above-mentioned open problems of Yannakakis [551 and 
Rothvofi|42|. our results provide a lower bound on the extension complexity of stable set polytopes that 
goes beyond what is implied by the Huang-Sudakov conjecture (thanks to the fact that we consider a 
different part of the slack matrix) . Although our lower bounds are strong, unconditional and apply to 
explicit polytopes that are well-known in combinatorial optimization, they have very accessible proofs. 

• Second, we generalize the factorization theorem to conic EFs, that is, reformulations of an LP through a 
conic program. In particular, this implies a factorization theorem for semidefinite EFs: the semidefinite 
extension complexity of a polytope equals the positive semidefinite rank {PSD rank) of its slack matrix. 

• Third, we generalize the tight connection between linearis EFs and classical communication complexity 
found by Faenza et al. jl6) to a tight connection between semidefinite EFs and quantum communication 
complexitj0. We show that any rank-r PSD factorization of a (nonnegative) matrix M gives rise to a 
one-way quantum protocol computing M in expectation that uses logr -I- 0(1) qubits and, vice versa, 
that any one-way quantum protocol computing M in expectation that uses q qubits results in a PSD 
factorization of M of rank 2*^. Via the semidefinite factorization theorem, this yields a characterization 
of the semidefinite extension complexity of a polytope in terms of the minimum complexity of (one-way) 
quantum protocols that compute the corresponding slack matrix in expectation. 

Then, we give a complexity logr + 0(1) quantum protocol for computing a nonnegative matrix M in 
expectation, whenever there exists a rank-r matrix N such that M is the entry-wise square of N. This 
implies in particular that every d-dimensional polytope with 0/1 slacks has a semidefinite EF of size 
0{d). 

^AU logarithms in this paper are computed in base 2. 

®In this paragraph, and later in Sections [4] and [S] an EF (in the sense of the previous section) is called a linear EF. The use 
of adjectives such as "linear", "semidefinite" or "conic" will help distinguishing the different types of EFs. 

'^After a first version of this paper appeared, 1241 Theorem 2] have used this notion of PSD rank to characterize the number 
of qubits of communication between Alice and Bob needed to generate a shared probability distribution. 
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Finally, we obtain an exponential separation between classical and quantum protocols that compute 
our specific matrix M = M(n) in expectation. On the one hand, our quantum protocol gives a rank- 
0{n) PSD factorization of M. On the other hand, the nonnegativc rank of M is 2^*^"^ because the 
nondeterministic communication complexity of the support matrix of M is 51 (n). Thus wc obtain an 
exponential separation between nonnegativc rank and PSD rank. 

Wc would like to point out that the lower bounds on the extension complexity of polytopes established in 
Section [3] were obtained by first finding an efficient PSD factorization or, equivalently, an efficient one-way 
quantum communication protocol for the matrix M = AI(n). In this sense our classical lower bounds stem 
from quantum considerations somewhat similar in style to [551 HI 12 • See [T5] for a survey of this line of work. 

We would also like to point out that some of our results in Sections S] and [5] were also obtained by 
Gouveia, Parillo and Thomas. This applies to Theorem [T4l Corollary [T6l Theorem [T9l and Corollary [20l We 
were aware of the fact that they had obtained Theorem [14] and Corollary [T6| prior to writing this paper. 
However, their proofs were not yet publicly available at that time. Theorem [THj and Corollary [^Dj were 
obtained independently, and in a different context. All their results arc now publicly available, see [22] . 

1.3 Organization 

The discovery of our lower bounds on extension complexity crucially relied on finding the right matrix M and 
the right polytope whose slack matrix contains M. In our case, we found these through a connection with 
quantum communication. However, these quantum aspects are not strictly necessary for the resulting lower 
bound proof itself. Hence, in order to make the main results more accessible to those without background 
or interest in quantum computing, we start by giving a purely classical presentation of those lower bounds. 

In Section [5] we define our matrix M and lower bound the nondeterministic communication complexity 
of its support matrix. In Section [3] we embed M in the slack matrix of the cut polytope in order to lower 
bound its extension complexity; further reductions then give lower bounds on the extension complexities of 
the stable set, and TSP polytopes. In Section |4| we state and prove the factorization theorem for arbitrary 
closed convex cones. In Section [5] we establish the equivalence of PSD factorizations of a (nonnegativc) 
matrix M and one-way quantum protocols that compute M in expectation, and give an efficient quantum 
protocol in the case where some entry-wise square root of M has small rank. This is then used to provide 
an exponential separation between quantum and classical protocols for computing a matrix in expectation 
(equivalently, an exponential separation between nonnegativc rank and PSD rank). Concluding remarks are 
given in Section refsec: concluding-remarks. 

2 A Simple Matrix with Large Rectangle Covering Bound 

In this section we consider the following 2" x 2" matrix AI = M{n) with rows and columns indexed by n-bit 
strings a and h, and real nonnegativc entries: 

Note for later reference that Mab can also be written as 

Mab = 1 - (2diag(a) - aa^hh''), (2) 

where (•, •) denotes the Frobenius inner produci|f| and diag(a) is the n x n diagonal matrix with the entries 
of a on its diagonal. Let us verify this identity, using a,6G{0,l}": 

l-(2diag(a) - aa'',bb'') 

= 1 - 2(diag(a), fefeT) + {aa'',bb'') 

= 1 - 2aT6 + (a^f = (1 - a^)^. 

®The Frobenius inner product is the component-wise inner product of two matrices. For matrices X and Y of the same 
dimensions, this equals Tr [X'^Y]. When X is symmetric this can also be written Tr [Xy], 
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Let suppniat(Af ) be the binary support matrix of M, so 



suppniat(M)ab 



1 ifMabT^O, 
otherwise. 



For a given matrix, a rectangle is the cartesian product of a set of row indices and a set of column indices. 
In j52j it was shown that an exponential number of (monochromatic) rectangles are needed to cover all the 

1- entries of the support matrix of M. Equivalently, the corresponding function / : {0, 1}" x {0, 1}" {0, 1} 
has nondcterministic communication complexity of fi(n) bits. For the sake of completeness we repeat the 
proof here: 

Theorem 1 ([521). Every 1 -monochromatic rectangle cover o/ suppmat(Af ) has size 2^^("). 

Proof. Let . . . , Rk be a 1-cover for /, i.e., a set of (possibly overlapping) 1-monochromatic rectangles in 
the matrix suppmat(M) that together cover all 1-entries in suppmat(Af). 

We use the following result from [Ml Example 3.22 and Section 4.6], which is essentially due to Razborov 

m- 

There exist sets A,B C {0, 1}" x {0, 1}" and probability distribution /z on {0, 1}" x {0, 1}" such 
that all (a, b) € A have a'^b = 0, all (a, b) £ B have a'^b = 1, fJ,{A) = 3/4, and there are constants 
a, (5 > (independent of n) such that for all rectangles i?, 

fi{Rr\B)^a- ^{R n A) - 2^'^". 

(For sufficiently large n, a = 1/135 and 6 = 0.017 will do.) 

Since the Ri are 1-rectangles, they cannot contain elements from B. Hence /^(-Ri n i?) = and ^{Ri C] A) ^ 

2- '5"/a. However, since all elements of A are covered by the Ri, we have 



- = ^l{A) = A. |J(i?, n A) U Y.^Ji{R^ r^A)^k 



) — (571 



Hence fc ^ 2'^^("). □ 



3 Strong Lower Bounds on Extension Complexity 

Here we use the matrix M — M{n) defined in the previous section to prove that the (linear) extension 
complexity of the cut polytope of the n- vertex complete graph is 2^^^"^ , i.e., every (linear) EF of this polytope 
has an exponential number of inequalities. Then, via reductions, we prove super-polynomial lower bounds 
for the stable set polytopes and the TSP polytopcs. To start, let us define more precisely the slack matrix 
of a polytope. For a matrix A, let Ai denote the ith row of A and A^ to denote the jth column of A. 

Let P = [x \ Ax ii,b} ^ conv(y) be a polytope, with A G R™^'', 6 e M"* and F = {vi, . . . , u„} C 

W^. Then S G M™""" defined as S^J := b^ - A^vj with i e [m] := {1, . . . , m} and j £ [n] := {1, . . . , n} is the 
slack matrix of P w.r.t. Ax ^ b and V. We sometimes refer to the submatrix of the slack matrix induced by 
rows corresponding to facets and columns corresponding to vertices simply as the slack matrix of P, denoted 
by SiP). 

Recall that: 

1. an extended formulation (EF) of P is a linear system in variables {x,y) such that .t G P if and only if 
there exists y satisfying the system; 

2. an extension of P is a polytope Q CM.^ such that there is a linear map tt : R'^ ^> M'' with tt{Q) = P; 

3. the extension complexity of P is the minimum size (i.e., number of inequalities) of an EF of P. 
We denote the extension complexity of P by xc(P). 
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3.1 The Factorization Theorem 

A rank-r nonnegative factorization of a (nonnegative) matrix M is a factorization M = TU where T and U are 
nonnegative matrices with r columns (in case of T) and r rows (in case of [/), respectively. The nonnegative 
rank of M (denoted by: rank_(_(Af)) is thus simply the minimum rank of a nonnegative factorization of M. 
Note that rank-|_(Af) is also the minimum r such that M is the sum of r nonnegative rank-1 matrices. In 
particular, the nonnegative rank of a matrix M is at least the nonnegative rank of any submatrix of M . 

The following factorization theorem was proved by Yannakakis (see also [Hj). It can be stated succinctly 
as: xc(P) = rank-)- (5*) whenever P is a polytope and S a slack matrix of P. We include a sketch of the proof 
for completeness and we will use the following lemma which is an easy consequence of the Farkas' Lemma 

Lemma 2. Let P = {x E M" | Ax < b} be a polytope with dim(P) > 1 and cx < 5 a valid inequality for P. 
Then there exists nonnegative multipliers X such that XA ~ c and Xb = 5, i.e., cx < S can be derived as a 
nonnegative combination from Ax < b. 

Theorem 3 ([55]). Let P ^ {x gM.'^ \ Ax ^ 6} = conv(V) be a polytope with dim(P) ^ I, and let S denote 
the slack matrix of P w.r.t. Ax ^ b and V. Then the following are equivalent for all positive integers r: 

(i) S has nonnegative rank at most r; 

(a) P has an extension of size at most r (that is, with at most r facets); 
(Hi) P has an EE of size at most r (that is, with at most r inequalities). 

(sketch). It should be clear that (ii) implies (iii). We prove that (i) implies (ii), and then that (iii) implies 
(i). 

First, consider a rank-r nonnegative factorization S = TU of the slack matrix of P. Notice that we may 
assume that no column of T is zero, because otherwise r can be decreased. We claim that P is the image of 

Q {{x, y) e M'*+" I Ax + Ty^b, y^O} 

under the projection tt^, onto the x-space. We see immediately that TTxiQ) C P since Ty ^ 0. To prove the 
inclusion P C Tr^iQ), it suffices to remark that for each point Vj G V the point {vj, [/■') is in Q since 

Avj + TW ^ Avj + b~ Avj^b and ^ 0. 

Since no column of T is zero, Q is a polytope. Moreover, Q has at most r facets, and is thus an extension 
of P of size at most r. This proves that (i) implies (ii). 
Second, suppose that the system 

Ex + Ey ^ g, y^O 

with (x, y) e R'^+'" defines an EF of P. Let Q C R'^+'' denote the set of solutions to this system. Thus Q is a 
(nonnecessarily bounded) polyhedron. For each point Vj £ V, pick Wj £ W such that {vj,Wj) € Q. Because 

Ax < 6 3y : Ex + Ey ^ g, y ^ 0, 

each inequality in Ax ^ b is valid for all points of Q. Let Sq be the nonnegative matrix that records the 
slacks of the points {vj,Wj) with respect to the inequalities of y ^ 0, and then of Ax ^ b. By construction, 
the submatrix obtained from Sq by deleting the r first rows is exactly S, thus rank-|-(5) ^ rank-i-(S'Q). 
Furthermore, by Lemma [2] any valid inequality cx < S is a, nonnegative combination of inequalities of the 
system Ax < b and thus every row of Sq is a nonnegative combination of the first r rows of Sq. Thus, 
rank4_(S'Q) < r. Therefore, rank-|_(5) ^ r. Hence (iii) implies (i). □ 
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We will prove a generalization of Theorem [3] for arbitrary closed convex cones in Section 01 but for now 
this special case is all we need. 

We would like to emphasize that we will not restrict the slack matrix to have rows corresponding only 
to the facet-defining inequalities. This is not an issue since appending rows corresponding to redundanl|j 
inequalities does not change the nonnegative rank of the slack matrix. This fact was already used in the 
second part of the previous proof (see also Lemma [TS] below) . 

Theorem [3] shows in particular that we can lower bound the extension complexity of P by lower bounding 
the nonnegative rank of its slack matrix 5; in fact it suffices to lower bound the nonnegative rank of any 
submatrix of the slack matrix S corresponding to an implied system of inequalities. To that end, Yannakakis 
made the following connection with nondeterministic communication complexity. Again, we include the 
(easy) proof for completeness. 

Theorem 4 ([55]). Let M be any matrix with nonnegative real entries and suppmat(Af) its support matrix. 
Then rank-|_(M) is lower hounded by the rectangle covering bound for suppmat(A/). 

Proof. If M ~ TU is a rank-?' nonnegative factorization of il/, then S can be written as the sum of r 
nonnegative rank-1 matrices: 

r 

fc=i 

Taking the support on each side, we find 

r 

supp(5) = U supp(T^C/fe) 
fc=i 

r 

= [j supp(T'=) X supp(C/fc). 

fc=i 

Therefore, suppmat(Af) has a 1-monochromatic rectangle cover with r rectangles. □ 
3.2 Cut and Correlation Polytopes 

Let Kn — {Vn,En) dcnotc the n-vertex complete graph. For a set X of vertices of Kn, we let 5{X) denote 
the set of edges of Kn with one endpoint in X and the other in its complement X. This set 6{X) is known 
as the cut defined by X. For a subset F of edges of Kn, we let G M.^" denote the characteristic vector 
of F, with Xe = 1 if e £ F and xf = otherwise. The cut polytope CUT(n) is defined as the convex hull of 
the characteristic vectors of all cuts in the complete graph Kn = {VmEn). That is, 

CUT(n) conv{x*^^' e E^" | X C K}- 

We will not deal with the cut polytopes directly, but rather with / 1-polytopes that are linearly isomorphic 
to them. The correlation polytope (or Boolean quadric polytope) COR(rt) is defined as the convex hull of all 
the rank-1 binary symmetric matrices of size n x n. In other words, 

COR(n) conv{66T e M"^" \ be{0, 1}"}. 

We use the following known result: 

Theorem 5 ([47|). For all n, COR(n) is linearly isomorphic to CUT(n + 1). 

Because M is nonnegative, Eq. ([2]) gives us a linear inequality that is satisfied by all vertices bb'^ of 
COR(n), and hence (by convexity) is satisfied by all points of COR(n): 

''An inequality of a linear system is called redundant if removing the inequality from the system does not change the set of 
solutions. 
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Lemma 6. For all a G {0, 1}", the inequality 

{2diag(a) - aaT,a;) ^ 1 (3) 

is valid for COR(n). Moreover, the slack of vertex x = hV^ with respect to ([3]) is precisely Mab- 

We remark that ^ is weaker than the hypermetric inequality jl4j (diag(a) — aa'^,x) ^ 0, in the sense 
that the face defined by the former is strictly contained in the face defined by the latter. Nevertheless, we 
persist in using Now, we prove the main result of this section. 

Theorem 7. There exists some constant C > such that, for all n, 

xc(CUT(ri + 1)) = xc(COR(n)) ^ 2^^" . 
In particular, the extension complexity o/CUT(ri) is 2^^^"^. 

Proof. The equality is implied by Theorem [5l Now, consider any system of linear inequalities describing 
COR(n) starting with the 2" inequalities (|3]), and a slack matrix S w.r.t. this system and {hh^ \ b e {0, 1}"}. 
Next delete from this slack matrix all rows except the 2" first rows. By Lemma [SJ the resulting 2" x 2" 
matrix is M. Using Theorems |31 HI and[Tl and the fact that the nonnegative rank of a matrix is at least the 
nonnegative rank of any of its submatrices, we have 

xc(COR(n)) = rank+(S') 
^ rank+(M) 

for some positive constant C. □ 
3.3 Stable Set Polytopes 

A stable set S (also called an independent set) of a graph G = {V, E) is a subset S CV oi the vertices such 
that no two of them are adjacent. For a subset S V, we let € denote the characteristic vector of 
S, with = 1 if "f G 'S' and Xv ^ ^ otherwise. The stable set polytope, denoted STAB(G'), is the convex 
hull of the characteristic vectors of all stable sets in G, i.e., 

STAB(G) := conv{x^ € | 5 stable set of G}. 

Recall that a polytope Q is an extension of a polytope P if P is the image of Q under a linear projection. 

Lemma 8. For each n, there exists a graph Hn with 0{n^) vertices such that STAB(77„) contains a face 
that is an extension o/COR(7i). 

Proof. Consider the complete graph Kn with vertex set Vn '■= [n]. For each vertex i of Kn we create two 
vertices labeled ii,ii in Hn and an edge between them. For each edge ij of Kn, we add to iJ„ four vertices 
labeled ij, ij, ij, ij and all possible six edges between them. We further add the following eight edges to iJ„: 

{U, jj}, {u, m}> {ij, JJ}, 

{u, ii], la, iii, m}j in, 

See Fig. [T] for an illustration. The number of vertices in Hn is 2n + 4(2) . 

Thus the vertices and edges of Kn arc represented by cliques of size 2 and 4 respectively in _ff„. We 
will refer to these as vertex-cliques and edge-cliques respectively. Consider the face F = F(n) of STAB(7?„) 
whose vertices correspond to the stable sets containing exactly one vertex in each vertex-clique and each 
edge-clique. (The vertices in this face correspond exactly to stable sets of Hn with maximum cardinality.) 
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Figure 1: The edges and vertices of iJ„ corresponding to vertices i, j and edge ij of Kn- 

Consider the hnear map tt : M^(^") M"^" mapping a point x G M^(^") to the point y G R"^" such 
that Hij = Hji = Xij for i ^ j. In this equation, the subscripts in yij and yji refer to an ordered pair of 
elements in [n] , while the subscript in Xij refers to a vertex of iJ„ that corresponds either to a vertex of A'„ 
(if i = j) or to an edge of Kn (if i ^ j). 

We claim that the image of F under tt is COR(n), hence F is an extension of COR(ri); observe that 
is suffices to consider 0/1 vertices as _F is a 0/1 polytope and the projection is an orthogonal projection. 
Indeed, pick an arbitrary stable set S of 77„ such that x := is on face F. Then define b £ {0, 1}" by 
letting bi := 1 if ii G S and bi otherwise (i.e., ii G S). Notice that for the edge ij of Kn wc have ij G S 
if and only if both vertices ii and jj belong to S. Hence, tt{x) ^ y = bb'^ is a vertex of COR(n). This proves 
7r(F) C COR(n). Now pick a vertex y := bb'^ of COR(n) and consider the unique maximum stable set S 
that contains vertex ii lihi ~ \ and vertex ii if bi = 0. Then x := is a vertex of F with tt{x) = y. Hence, 
7r(F) 2 COR(n). Thus Tr{F) = COR(n). This concludes the proof. □ 

Our next lemma establishes simple monotonicity properties of the extension complexity used in our 
reduction. 

Lemma 9. Let P , Q and F he polytopes. Then the following hold: 

(i) if F is an extension of P, then xc(_F) ^ xc(P); 

(ii) if F is a face ofQ, then xc(Q) ^ xc(-F). 

Proof. The first part is obvious because every extension of F is in particular an extension of P. For the 
second part, notice that a slack matrix of F can be obtained from the (facet- vs- vertex) slack matrix of Q by 
deleting columns corresponding to vertices not in F. Now apply Theorem [3l □ 

Using previous results, we can prove the following result about the worst-case extension complexity of 
the stable set polytope. 

Theorem 10. For all n, one can construct a graph Gn with n vertices such that the extension complexity 
of the stable set polytope STAB(G'„) ts 2f^("'^'). 

Proof. W.l.o.g., we may assume n ^ 18. For an integer p ^ 3, let f{p) ;= \V{Hp)\ = 2^ + 4(2). Given 
n ^ 18, wc define p as the largest integer with f{p) ^ n. Now let G„ be obtained from Hp by adding 
n — f{p) isolated vertices. Then STAB(iJp) is linearly isomorphic to a face of STAB(G'„). Using Theoreni[7] 
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in combination with Lemmas [S] and IHl wc find that 



xc(STAB(G'„)) > xc(STAB(i7j,)) 
^ xc(COR(p)) 

□ 

3.4 TSP Polytopes 

RecaU that TSP(ri,), the traveling salesman polytope or TSP polytope of Kn ~ (l/„,i?„), is defined as the 
convex hull of the characteristic vectors of all subsets F C En that define a tour of Kn- That is, 

TSP(77.) := conv{x^ G M^" | C is a tour of A'„}. 

We now prove that the polytope COR(rt) is the linear projection of a face of TSP(0(n^)), implying the 
following: 

Lemma 11. For each n, there exists a graph Gn with q ~ 0{n^) vertices such that TSP{q) contains a face 
that is an extension ofCOK{n). 

Proof. Recall that 

COR(??.) = conv{6&T e E»x« | 6 e {0, 1}"}. 

To prove the lemma we start with constructing a graph Gn with 0{n^) vertices such that the tours of G„ 
correspond to the nxn rank-1 binary symmetric matrices bb'^ , where b € {0, 1}". This is done in three steps: 

(i) define a 3SAT formula (/>„ with variables such that the satisfying assignments of bijectively 
correspond to the matrices bb'^ , where b £ {0, 1}"; 

(ii) construct a directed graph D„ with 0{n^) vertices such that each directed tour of -D„ defines a satisfying 
assignment of (jim and conversely each satisfying assignment of has at least one corresponding 
directed tour in £>„; 

(iii) modify the directed graph Z)„ into an undirected graph Gn in such a way that the tours of G„ bijectively 
correspond to the directed tours of £>„ . 

Step (i). For defining 0„ we use Boolean variables Cy G {0, 1} for i,j G [n] and let 

i.je[nl 

The four clauses {Cu V Cjj V ) , {Cu V Cjj V Cij ) , {Cu V Cjj V Cij ) and [Cu V Cjj V Cij ) model the equation 
Cij = GiihCjj. Hence, C G {0, l}"'^" satisfies </>„ if and only if there exists b G {0, 1}" such that Cij ~ biAbj 
for all i,j G [n], or in matrix language, C = bb'^ . 

Step (ii). To construct a directed graph D„ whose directed tours correspond to the satisfying assignments 
of 0„ we use the standard reduction from 3SAT to HAMPATH [48] . 

We order the variables of arbitrarily and construct a gadget for each variable as follows. Suppose that 
the fcth variable occurs in p clauses. We create a chain of 3]3+ 1 nodes, labeled v^^i,. . . , Vk,3p+i, where each 
node Wfc^f with £ < 3p + 1 is connected to the next node Vk,i+i with two opposite directed edges. Figure [5] 
illustrates this. Traversing this chain from left to right is interpreted as setting the fcth variable to false and 
traversing it from right to left is interpreted as setting the fcth variable to true. We also have two nodes 
Sfc,ifc connected to this chain with directed edges (sfc,Wfc,i), (sfe, Wfc,3p+i), and (wfe,3p+i,ife) creating 

a diamond structure. 
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Figure 2: Gadget for the /cth variable occuring in p clauses. 

Next, we order the clauses of 4>n arbitrarily and create a node for each clause. The node for the mth 
clause is denoted by Wm- We connect these extra nodes to the gadgets for the variables as follows. Suppose, 
as before, that the fcth variable appears in p clauses. Consider the ^th of these clauses in which the kih. 
variable appears, and let m be the index of that clause. If the fcth variable appears negated in the mth 
clause then we add the path Wfe.3f-i, Wm, ffc,3£. Otherwise the fcth variable appears non-negated in the mth 
clause and we add the path Vk^u,Wm,Vk.u-i- Figure [3] illustrates this. 





Figure 3: Gadgets for clauses in case the fcth variable appears negated in the mth clause and non-negated 
in the m'th clause. 

Next we connect the gadgets corresponding to the variables by identifying with Sk+i for 1 ^ fc < n^. 
Finally, we add a directed edge from t„2 to si. Figure |4] illustrates the final directed graph obtained. 




Figure 4: Final graph. Directed edges incident to nodes Wn 
clauses in Boolean formula (bn- 



depend on the actual ordering of variables and 
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To see why the directed tours of the final directed graph D„ define satisfying assignments of our Boolean 
formula , observe that each directed tour of Dn encodes a truth assignment to the "n? variables depending 
on which way the corresponding chains are traversed. Because a directed tour visits every node and because 
the node Wm corresponding to a clause can be visited only if wc satisfy it, the truth assignment satisfies 0„. 
Conversely, every satisfying assignment of 0„ yields at least one directed tour in £)„. (If the mth clause is 
satisfied by the value of more than one variable, we visit Wm only once, from the chain of the first variable 
whose value makes the clause satisfied). 

Step (Hi). For each node v of D„ we create a path Vi-^, Wmid, '^out in the (undirected) graph G„. For each 
directed edge {v,w) of D„, we add to the graph G„ an edge between Vo^t and Win. As is easily seen, the 
tours of Gn bijectively correspond to the directed tours of D„. Note that Gn has q 'i{n{n — 1) • 13 + ti ■ 
(3n - 2) + rt^ + 4n(n - 1)) = 0{n'^) vertices. 

Consider the face F of TSP(g) defined by setting to all variables corresponding to non-edges of G^, 
so that the vertices of F are the characteristic vectors of the tours of G„. To conclude the proof we give a 
linear projection -k : x ^ y := ^{x) mapping F to COR(n). For x S and i, j G [n], we let yij = Xe, 
where e is the edge (wotit,fc,2, Wm.fc.i) of G„ corresponding to the directed edge {vk.2,Vk,i) and k is the index 
of the variable Gy of (/)„. It follows from the above discussion that tt maps the face F of TSP((7) to COR(n). 
The lemma follows. □ 

The final theorem in this section follows from Theorem [3 Lemmas |9] and [TlJ using an argument similar 
to that used in the proof of Theorem [TOl 

Theorem 12. The extension complexity of the TSP polytope TSP(ri) is 2^^^" ^ 



4 Conic and Semidefinite EFs 

In this section we extend Yannakakis's factorization theorem (Theorem to arbitrary closed convex cones. 
The proof of that theorem shows that, in the linear case, any EF can be brought in the form Ex + Fy = g, 
y ^ without increasing its size. This form is the basis of our generalization: we replace the nonnegativity 
constraint y ^ by a general conic constraint y G C. 

Let Q = {{x,y) G M'^+'"' | Ex + Fy = g, y £ C} for an arbitrary closed convex cone G C M}, where 
E € W'^,F G RP""", and g G M^. Let C* := {z G R'' | z^y ^ 0,Vy G G} denote the dual cone of G. We 
define the projection cone of Q as 

GQ:={/iGRn^VeG*} 

and 

proj^(g) := {x G R'^ \ fi''Ex ^ fi''g,Wfi G Cq}. 
In a first step we show that proj^((3) equals 

ttAQ) G M'' I G : {x,y) G Q}, 

the projection of Q to the x-space. 

Lemma 13. With the above notation, we have HxiQ) ~ projj,(Q). 

Proof. Let a G ttx{Q). Then there exists y £ C with Ea + Fy = g. Pick any fi G Cq. Then, ^'^ Ea + ji^ Fy = 
fi'^g holds. Since F'^ ^ G G* and y G G we have that [F"^ nYy = Fy ^ 0. Therefore jji^ Ea ^ y.'^ g holds for 
all (U G Cq. We conclude a G proj^(Q) and as a was arbitrary 7r^(Q) C proj2.(Q) follows. 

Now suppose a ^ iTxiQ). We show that this implies a ^ proj^(Q). As a ^ t^x{Q) there is no y G G 
such that Fy ~ g — Ea or, equivalently, the convex cone F{C) := {Fy \ y G G} does not contain the point 
g — Ea. Since G is a closed cone, so is F{C). Therefore, by the Strong Separation Theorem there exists 
^eW such that z > is valid for F{C) but /^^(^ _ Ea) < 0. Then z = ^i'^{Fy) = {n'^F)y ^ is valid 
for G, i.e., {^i^ F)y ^ holds for aU y & C, implying F'' fi G G* and so /i G Cq. Because fi''{g — Ea) < we 
have ^'^ Ea > ijJ g with /i G Cq so that a ^ proj^((5) follows. □ 
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Let P = {x eR"^ \ Ax iS, b} = conv(F) be a polytope, with A G R"^'*, 6 G R™ and F = {vi, . . . , u„} C 
M.'^. Then Ex + Fy = g, y G C is a cotizc EF of P whenever a: G P if and only if there exists y € C such 
that Ex + Fy — g. The set Q = {{x, y) G | Ex + Fy ~ g, y G C} is then caUed a come extension of P 

w.r.t. C. 

We now prove a factorization theorem for the slack matrix of polytopes that generalizes Yannakakis's 
factorization theorem in the linear case. Yannakakis's result can be obtained as a corollary of our result by 
taking C = M^, and using Theorem [Ml together with the fact that (M^)* = M*^. 

Theorem 14. Let P = {x e W'^ \ Ax ^ b} = conv(l/) be a polytope with dim(P) ^ 1 defined by m 
inequalities and n points respectively, and let S be the slack matrix of P w.r.t. Ax ^ b and V. Also, let 
C C R*^ be a closed convex cone. Then, the following are equivalent: 

(i) There exist T,U such that (the transpose of) each row of T is in C* , each column of U is in C, and 
S = TU. 

(a) There exists a conic extension Q ~ {{x, y) G R''^'^ | Ex + Fy = g, j/ G C} with P ~ t^x{Q)- 

Before proving the theorem, we prove a lemma which will allow us to get rid of rows of a slack matrix 
that correspond to redundant inequalities. Below, we call a factorization as in (i) a factorization of S w.r.t. 
C. 

Lemma 15. Let P C R'' be a polytope with dim(P) ^ 1, let S and S' be two slack matrices of P, and let 
C C R'' be a closed convex cone. Then S has a factorization w.r.t. C if and only if S' has a factorization 
w.r.t. C. 

Proof. It suffices to prove the theorem when S' is the submatrix of S induced by the rows corresponding 
to facet-defining inequalities and the columns corresponding to vertices, that is, when S' — S{P). One 
implication is clear: if S has a factorization w.r.t. C, then S" also because S" is a submatrix of S. 

For the other implication, consider a system Ax ^ 6 of to inequalities and a set V = {vi, ...,«„} of ?i 
points such that P = {a; G M'' | Ax ^ b} = conv(y). Assume that the / first inequalities of Ax ^ b are 
facet-defining, while the remaining m — f are not, and that the v first points of V are vertices, while the 
remaining n — v are not. 

Consider an inequality AiX ^ bi with i > f. Suppose first that the inequality is redundant. By Farkas's 
lemma (using dim(P) ^ 1), there exist nonnegative coefficients /i^^^ with k G [/] such that Ai = X]fcG[/] l^i,k^k 
and bi = f-i,kbk as P is a polytope. If the inequality is not redimdant, since it is not facet-defining, 

it is satisfied with equality by all points of P. In this case, we let /li.fc for all /c G [/]. Finally, for i ^ / 
we let /ii.fc := 1 if z = fc and /i^.fc := otherwise. 

Next, consider a point Vj with j > v. Because Vj is in P, it can be expressed as a convex combination of 
the vertices of P: Vj = X^fefu] -^i.^^^j where Xj^t with i G [v\ are nonnegative coefficients that sum up to 1. 
Similarly as above, for j ^ v we let Xj^i := 1 if j = i and ^j^i := otherwise. 

Now, let S' = TU be a factorization of S" w.r.t. C. That is, we have row vectors Ti, . . . , Tf with (Tfc)''' G 
C* (for fc G [/]) and column vectors U^, ...^U" with [/^ G C (for £ G [v]) such that bk - AkVt = S'^^ = TkU^ 
for k G [f],t G [v]. 

We extend the factorization of S" into a factorization of S by letting Ti := f-i-kTk and U'^ :~ 

Efe[ij] ^j,^^^ -^'-'^ * > ./ f-iid j > V. Given our choice of coefficients, these equations also hold for i ^ / and 
j ^ V. Clearly, each Ti (transposed) is in C* and each is in C. A straightforward computation then 
shows TiU^ = Sij for all i G [to], j G [n]. Therefore, Ti with i G [to] and with j G [n] define a factorization 
of 5* w.r.t. C. □ 

of Theorem \14\ We first show that a factorization induces a conic extension. Suppose there exist matrices 
T, U as above. We claim that Q with E -.^ A, F := T and g := b has the desired properties. Let vj G V, 
then = TU^ = b — Avj and so it follows that {vj, U^) G Q ^-nd Vj G tTx{Q)- Now let x G tTx{Q)- Then, 
there exists y & C such that Ax + Ty = b. Since Tiy ^ for all i G [m], we have that a; G P. This proves 
the first implication. 
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For the converse, suppose P = TTxiff) with Q being a conic extension of P. By Lemma [T51 Tr^iQ) = {x £ 
R'^ I n'^Ex < n'^gy^ e Cq}, where Cq = {fj. e W \ F'^^i e C*}. By Lemma [fel it suffices to prove that 
the submatrix of S induced by the rows corresponding to the inequahtics of Ax ^ b that define facets of P 
admits a factorization w.r.t. C. Thus, we assume for the rest of the proof that all rows of S correspond to 
facets of P. Then, for any facet-defining inequality AiX ^ 6^ of P there exists € Cq such that fJ,jEx ^ fijg 
defines the same facet as AiX ^ bi. (This follows from the fact that Cq is closed; see also [331 Theorem 
4.3.4].) Scaling fii if necessary, this means that fij E = Ai + and fxjg ~ bi + delta, where c^x ~ 5 is 
satisfied for all points of P. We define T,; := jijE for all i; in particular {TiY e C* as /i^ S Cq. Now let 
Vj € V. Since P = tTx{Q), there exists a j/j G C such that Evj + Eyj = g and so ^ijEvj + ^J-jEyj = ^Jg. 
With the above we have AiVj + c^Vj + TiUj ~ bi + S, hence AiVj + TiUj = hi and as Vj £ tTx{Q) we deduce 
TiUj ^ 0. The slack of Vj w.r.t. AiX ^ hi is bi — AiVj ~ ^Jg — ^jEvj — fj.jFyj = Tiyj. This implies the 
factorization S = TU with T, = fiJ F and =yj. □ 

For a positive integer r, we let S'^ denote the cone of r x r symmetric positive semidefinitc matrices 
embedded in R'"(''+i)/2 in such a way that, for all y,z £ the scalar product z^y is the Frobenius product 
of the corresponding matrices. A semidefinite EF (resp. extension) of size r is simply a conic EF (resp. 
extension) w.r.t. C = The semidefinite extension complexity of polytope P, denoted by yiCsDp{P), is 
the minimum r such that P has a semidefinite EF of size r. Observe that (S'^)* = §!j_. Hence, taking 
C S\_ and k := r{r + l)/2 in Theorem [Ml we obtain the following factorization theorem for semidefinite 
EEs. 

Corollary 16. Let P = {x £ R'^ \ Ax ^ b} = conv(y) be a polytope. Then the slack matrix S of P w.r.t. 
Ax ^ b and V has a factorization S = TU so that (T,)^, £ if and only if there exists a semidefinite 
extension Q = {{x,y) £ Mrf+'-('-+i)/2 | Ex + Ey = g, y £ S";^} with P = tTx{Q). 

Analogous to nonnegative factorizations and nonnegative rank, we can define PSD factorizations and PSD 
rank. A rank-r PSD factorization of an to x ri matrix M is a collection of r x r symmetric positive semidefinite 
matrices Ti, . . . , T™ and t/\ . . . , U" such that the Frobenius product {T,, U^) = Tr [{T^yU^] = Tr [T,U^] 
equals Mij for all i £ [to], j £ [n]. The PSD rank of M is the minimum r such that M has a rank-r PSD 
factorization. We denote this by rankp5£)(M). By Corollary [1^1 (and also Lemma [T5|) . the semidefinite 
extension complexity of a polytope P is equal to the PSD rank of any slack matrix of P: xcsjjp{P) = 
rankp5£)(S') whenever 5 is a slack matrix of P. In the next section we will show that rankp5/)(A/) can be 
expressed in terms of the amount of communication needed by a one-way quantum communication protocol 
for computing M in expectation (Corollary [18]). 

5 Quantum Communication and PSD Factorizations 

In this section we explain the connection with quantum communication. This yields results that are inter- 
esting in their own right, and also clarifies where the matrix M of Section [2l came from. 

For a general introduction to quantum computation we refer to |39| and to f38| . and for quantum com- 
munication complexity we refer to |51j and to |5]. For our purposes, an r-dimensional quantum state p is 
an r X r PSD matrix of trace 1 A k-qubit state is a state in dimension r = 2 . If p has rank 1, it can 
be written as an outer product \(t)){4>\ for some unit vector |(^), which is sometimes called a pure state. We 
use \i) to denote the pure state vector that has 1 at position i and Os elsewhere. A quantum measurement 
(POVM) is described by a set of PSD matrices {Ee}0^Q, each labeled by a real number 6, and summing to 
the r-dimensional identity: X^eee ~ ^- When measuring state p with this measurement, the probability 
of outcome 9 equals Tr[Egp]. Note that if we define the PSD matrix E := J2eG0^^6, then the expected 
value of the measurement outcome is X^see ^-^'^ [^ep] = Tr [Ep\. 

^'^For simplicity wo restrict to real rather than complex entries, which docs not significantly affect the results. 
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5.1 Quantum Protocols 



A one-way quantum protocol with r- dimensional messages can be described as follows. On input i, Alice 
sends Bob an r-dimensional state pi. On input j, Bob measures the state he receives with a POVM {Eg} 
for some nonnegative values 9, and outputs the result. We say that such a protocol computes a matrix M in 
expectation, if the expected value of the output on respective inputs i and j, equals the matrix entry Mij. 
Analogous to the equivalence between classical protocols and nonnegative factorizations of M established by 
Faenza et al. [16], such quantum protocols are essentially equivalent to PSD factorizations of S: 

Theorem 17. Let M e R™^" he a matrix. Then the following holds: 

(i) A one-way quantum protocol with r-dimensional messages that computes M in expectation, gives a 
rank-r PSD factorization of M . 

(ii) A rank-r PSD factorization of M gives a one-way quantum protocol with (r + 1)- dimensional messages 
that computes M in expectation. 

Proof. The first part is straightforward. Given a quantum protocol as above, define :~ X^ege^^S' 
Clearly, on inputs i and j the expected value of the output is Tr [pii?-'] — Mij. 

For the second part, suppose we are given a PSD factorization of a matrix Ad , so we are given PSD 
matrices Ti, . . . ,Tm and , . . . satisfying Tr [TiC/-'] ~ Mij for all i,j. In order to turn this into a 
quantum protocol, define r = max^ Tr[ri]. Let pi be the (r + l)-dimensional quantum state obtained by 
adding a (r + l)st row and column to Ti/r, with 1 — Tr \Ti] /r as (r + l)st diagonal entry, and Os elsewhere. 
Note that p,; is indeed a PSD matrix of trace 1, so it is a well-defined quantum state. For input j, derive 
Bob's (r + l)-dimcnsional POVM from the PSD matrix as follows. Let A be the largest eigenvalue of 

, and define E^.^ to be U'-' /\, extended with a (d + l)st row and column of Os. Let Eq = I — E!^^. These 
two operators together form a well-defined POVM. The expected outcome (on inputs i,j) of the protocol 
induced by the states and POVMs that we just defined, is 



tXTt 



Ei 



^Tr [T,W] =My, 

so the protocol indeed computes M in expectation. □ 

We obtain the following corollary which summarizes the characterization of semidefinite EFs: 
Corollary 18. For a polytope P with slack matrix S , the following are equivalent: 

(i) P has a semidefinite extension Q ~ {{x, y) £ | Ex + Fy ~ g,y £ 

(ii) the slack matrix S has a rank-r PSD factorization; 

(Hi) there exists a one-way quantum communication protocol with (r -f 1) -dimensional messages (i.e., us- 
ing [log(r -I- 1)] qubits) that computes S in expectation (for the converse we consider r-dimensional 
messages). 



5.2 A General Upper Bound on Quantum Communication 

Now, we provide a quantum protocol that efficiently computes a nonnegative matrix M in expectation, 
whenever there exists a low rank matrix N whose entry-wise square is M. The quantum protocol is inspired 
by Ha Section S.sH 

Theorem 19. Let M be a matrix with nonnegative real entries, N he a rank-r matrix of the same dimensions 
such that Mij — Nfj. Then there exists a one-way quantum protocol using (r -f \)- dimensional pure-state 
messages that computes M in expectation. 

^^The proof is somewhat similar to, but independent of Zhang |56l Theorem 4.2], which is about the quantum communication 
complexity of correlations. 
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Proof. Let TV" = C/EV be the singular value decomposition of the transpose of iV; so U and V are unitary, 
I] is a matrix whose first r diagonal entries are nonzero while its other entries are 0, and (j|/7El/^|i) = Nij. 
Define = Since only its first r entries can be nonzero, we will view [(jji) as an r-dimensional 

vector. Let — and A = max^ A^. Add one additional dimension and define the normalized (r + 1)- 
dimensional pure quantum states = (|</'i)/A, i/l — Af/A^). Given input i, Alice sends to Bob. 
Given input j, Bob applies a 2-outcome POVM {£'^2,^0 = ^ ^a^} where £"^2 is the projector on the 
pure state U*\j) (which has no support in the last dimension of If the outcome of the measurement 

is £"^2 then Bob outputs A^, otherwise he outputs 0. Accordingly, the expected output of this protocol on 
input is 

A^ Pr[outcome £^2] = A2(V',|£^2 iV-*) = (0^|£i2|0^) 
= \{j\Um' = \{j\Um^)\' = Nf^ = M,,. 

The protocol only has two possible outputs: and A^, both nonnegative. Hence it computes M in expectation 
with an (r + l)-dimensional quantum message. □ 

Note that if M is a 0/1-matrix then we may take N ~ M. hence any low-rank 0/1-matrix can be 
computed in expectation by an efficient quantum protocol. We obtain the following corollary (implicit in 
Theorem 4.2 of Gouveia et al. [21]) which also implies a compact (i.e., polynomial size) semidefinite EF for 
the stable set polytopc of perfect graphs, reproving the previously known result by Lovasz |34 | l35 j . We point 
out that the result still holds when dim(P) + 2 is replaced by dim(P) + 1, see [12]. (This difference is due 
to normalization.) 

Corollary 20. Let P he a polytope such that S{P) is a 0/1 matrix. Then xcsdp{P) ^ dim(P) + 2. 

5.3 Quantum vs Classical Communication, and PSD vs Nonnegative Factoriza- 
tions 

We now give an example of an exponential separation between quantum and classical communication in 
expectation, based on the matrix M of Section [51 This result actually preceded and inspired the results in 
Section [3| 

Theorem 21. For each n, there exists a nonnegative matrix A/ 6 that can be computed in expectation 

by a quantum protocol using logn + 0(1) qubits, while any classical randomized protocol needs 0(n) bits to 
compute M in expectation. 

Proof. Consider the matrix N £ M."^ ^'^ whose rows and columns are indexed by n-bit strings a and b, 
respectively, and whose entries are defined as N^b = I ~ a'^b. Define M G R^"^^" by Mat = N^b- This 
M is the matrix from Section [2] Note that N has rank r ^ n + 1 because it can be written as the sum of 
n + l rank-1 matrices. Hence Theorem [19] immediately implies a quantum protocol with (n + 2)-dimensional 
messages that computes M in expectation. 

For the classical lower bound, note that a protocol that computes M in expectation has positive proba- 
bility of giving a nonzero output on input a, b if and only if Mab > 0. With a message m in this protocol we 
can associate a rectangle Rm = A x B where A consists of all inputs a for which Alice has positive proba- 
bility of sending m, and B consists of all inputs b for which Bob, when he receives message m, has positive 
probability of giving a nonzero output. Together these rectangles will cover exactly the nonzero entries of 
M. Accordingly, a c-bit protocol that computes M in expectation induces a rectangle cover for the support 
matrix of M of size 2^. Theorem [T] lower bounds the size of such a cover by 2"("), hence c = 17(n). □ 

Together with Theorem [17] and the equivalence of randomized communication complexity (in expecta- 
tion) and nonnegative rank established in |16j . we immediately obtain an exponential separation between 
nonnegative rank and PSD rank. 

Corollary 22. For each n, there exists M E ""^"^ with rank+(i\/) = 2^(") and rankpsD(A/) = 0{n). 
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In fact a simple rank-(?i + 1) PSD factorization of M is the following: let Ta := {la){^a}'^ and := 
Q {ly, then Tr[{Tay U'] = {1 - b)^ ^ Mat- 

6 Concluding Remarks 

In addition to proving the first unconditional super-polynomial lower bounds on the size of linear EFs for 
the cut polytope, stable set polytope and TSP polytope, we demonstrate that the rectangle covering bound 
can prove strong results in the context of EFs. In particular, it can be super-polynomial in the dimension 
and the logarithm of the number of vertices of the polytope, settling an open problem of Fiorini et al. jl8j . 

The exponential separation between nonnegative rank and PSD rank that we prove here (Theorem I2ip 
actually implies more than a super-polynomial lower bound on the extension complexity of the cut polytope. 
As noted in Theorem[5l the polytopes CUT(n) and COR(n — 1) are affinely isomorphic. Let Q{n) denote the 
polyhedron isomorphic (under the same affine map) to the polyhedron defined by ^ for a € {0, 1}". Then 
(i) every polytope (or polyhedron) that contains CUT(n) and is contained in Q{n) has exponential extension 
complexity; (ii) there exists a low complexity spectrahedron that contains CUT(n) and is contained in Q{n). 
(A spectrahedron is any projection of an affine slice of the positive semidefinite cone.) 

An important problem also left open in [55| is whether the perfect matching polytope has a polynomial- 
size linear EF. Yannakakis proved that every symmetric EF of this polytope has exponential size, a striking 
result given the fact that the perfect matching problem is solvable in polynomial time. He conjectured that 
asymmetry also does not help in the case of the perfect matching polytope. Because it is based on the 
rectangle covering bound, our argument would not yield a super-polynomial lower bound on the extension 
complexity of the perfect matching polytope. Even though a polynomial-size linear EF of the matching 
polytope would not prove anything as important as P 7^ NP, the existence of a polynomial-size EF or an 
unconditional super-polynomial lower bound for it remains open. 

We hope that the new connections developed here will inspire more research, in particular about ap- 
proximate EFs. Here are two concrete questions left open for future work: (i) find a slack matrix that 
has an exponential gap between nonnegative rank and PSD rank; (ii) prove that the cut polytope has no 
polynomial-size semidefinite EF (that would rule out SDP-based algorithms for optimizing over the cut 
polytope, in the same way that this paper ruled out LP-based algorithms). 

Our final remark concerns the famous log-rank conjecture |36| . It states that the deterministic com- 
munication complexity of a (finite) Boolean matrix M is upper bounded by a polynomial in the loga- 
rithm of its rank rank(M). On the one hand, this conjecture is equivalent to the following statement: 
log(rank+(M)) ^ polylog(rank(A/)) for all Boolean matrices M. On the other hand, we know that 
rankp5£)(M) = 0(rank(M)) for all Boolean matrices M by Theorem [T^ Using the interpretation of the 
nonnegative and PSD rank of M in terms of classical and quantum communication protocols computing M 
in expectation (see |16| and Theorem 1 17|) . we see that the log-rank conjecture is equivalent to the conjecture 
that classical protocols computing M in expectation are at most polynomially less efficient than quantum 
protocols. Accordingly, one way to prove the log-rank conjecture would be to give an efficient classical sim- 
ulation of such quantum protocols for Boolean M (for non-Boolean M , we already exhibited an exponential 
separation in this paper). 
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A Background on Polytopes 

A (convex) polytope is a set P C that is the convex hull conv(y) of a finite set of points V . Equivalently, 
P is a polytope if and only if P is bounded and the intersection of a finite collection of closed halfspaces. 
This is equivalent to saying that P is bounded and the set of solutions of a finite system of linear inequalities 
and possibly equalities (each of which can be represented by a pair of inequalities). 

Let P C R'' be a polytope. A closed halfspace that contains P is said to be valid for P. In this 
case the hyperplane H that bounds is also said to be valid for P. A face of P is either P itself or the 
intersection of P with a valid hyperplane. Every face of a polytope is again a polytope. A face is called 
proper if it is not the polytope itself. A vertex is a minimal nonempty face. A facet is a maximal proper 
face. An inequality c^x ^ 5 \s said to be valid for P if it is satisfied by all points of P. The face it defines is 
F :~ {x £ P \ c^x = S}. The inequality is called facet- defining if P is a facet. The dimension of a polytope 
P is the dimension of its affine hull aff(P). 

Every (finite or infinite) set V such that P = conv{V) contains all the vertices of P. Conversely, letting 
vert(P) denote the vertex set of P, we have P = conv(vert(P)). Suppose now that P is full- dimensional, 
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i.e., dini(P) = d. Then, every (finite) system Ax ^ b such that P ^ {x G M."^ \ Ax ^ b} contains all 
the facet-defining inequalities of P, up to scaling by positive numbers. Conversely, P is described by its 
facet-defining inequalities. 

If P is not full-dimensional, these statements have to be adapted as follows. Every (finite) system 
describing P contains all the facet-defining inequalities of P, up to scaling by positive numbers and adding 
an inequality that is satisfied with equality by all points of P. Conversely, a linear description of P can be 
obtained by picking one inequality per facet and adding a system of equalities describing aff(P). 

A 0/1-polytope in M.'^ is simply the convex hull of a subset of {0, 1}''. 

A (convex) polyhedron is a set P C R'' that is the intersection of a finite collection of closed halfspaces. 
A polyhedron P is a polytope if and only if it is bounded. 

For more background on polytopes and polyhedra, see the standard reference [57] . 
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